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Abstract 



Dynamic connectivity is a well-studied problem, but so far the most compelling progress has been 
confined to the edge-update model: maintain an understanding of connectivity in an undirected graph, 
subject to edge insertions and deletions. In this paper, we study two more challenging, yet equally 
fundamental problems: 

Subgraph connectivity asks to maintain an understanding of connectivity under vertex updates: 
updates can turn vertices on and off, and queries refer to the subgraph induced by on vertices. (For 
instance, this is closer to applications in networks of routers, where node faults may occur.) 

We describe a data structure supporting vertex updates in 0{m^/^) amortized time, where to denotes 
the number of edges in the graph. This greatly improves over the previous result [Chan, STOC'02], 
which required fast matrix multiplication and had an update time of 0{mP '^^). The new data structure 
is also simpler 

Geometric connectivity asks to maintain a dynamic set of n geometric objects, and query connec- 
tivity in their intersection graph. (For instance, the intersection graph of balls describes connectivity in a 
network of sensors with bounded transmission radius.) 

Previously, nontrivial fully dynamic results were known only for special cases like axis-parallel line 
segments and rectangles. We provide similarly improved update times, 0{rp/'^), for these special cases. 
Moreover, we show how to obtain sublinear update bounds for virtually all families of geometric ob- 
jects which allow sublinear-time range queries. In particular, we obtain \hs first sublinear update time 
for arbitrary 2D line segments: 0*{n^/^^); for c?-dimensional simplices: 0*{n^~ rfc^d+i) ); and for d- 
dimensional balls: O* {n^^Td+wjd+i)). 



1 Introduction 



1.1 Dynamic Graphs 

Dynamic graphs inspire a natural, challenging, and well-studied class of algorithmic problems. A rich body 
of the STOC/FOCS literature has considered problems ranging from the basic question of understanding 
connectivity in a dynamic graph |fT3l [TTl [34l l6l |3T|. to maintaining the minimum spanning tree 120 ], the 
min-cut f361, shortest paths |l9l|35l, reachabihty in directed graphs ||l0l|25l|26l[32l|33l, etc. 

But what exactly makes a graph "dynamic"? Computer networks have long provided the common 
motivation. The dynamic nature of such networks is captured by two basic types of updates to the graph: 

• edge updates: adding or removing an edge. These correspond to setting up a new cable connection, 
accidental cable cuts, etc. 

• vertex updates: turning a vertex on and off. Vertices (routers) can temporarily become "off" after 
events such as a misconfiguration, a software crash and reboot, etc. Problems involving only vertex 
updates have been called dynamic subgraph problems, since queries refer to the subgraph induced by 
vertices which are on. 

Loosely speaking, dynamic graph problems fall into two categories. For "hard" problems, such as 
shortest paths and directed reachability, the best known running times are at least linear in the number of 
vertices. These high running times obscure the difference between vertex and edge updates, and identical 
bounds are often stated |l9l[32l[33j| for both operations. For the remainder of the problems, sublinear running 
times are known for edge updates, but sublinear bounds for vertex updates seems much harder to get. For 
instance, even iterating through all edges incident to a vertex may take linear time in the worst case. That 
vertex updates are slow is unfortunate. Referring to the computer-network metaphor, vertex updates are 
cheap "soft" events (misconfiguration or reboot), which occur more frequently than the costly physical 
events (cable cut) that cause an edge update. 

Subgraph connectivity. As mentioned, most previous sublinear dynamic graph algorithms address edge 
updates but not the equally fundamental vertex updates. One notable exception, however, was a result of 
Chan [6] from STOC'02 on the basic connectivity problem for general sparse (undirected) graphs. This al- 
gorithm can support vertex updates in tim^H 0{m^-^'^) and decide whether two query vertices are connected 
in time 0(m^/^). 

Though an encouraging start, the nature of this result makes it appear more like a half breakthrough. 
For one, the update time is only slightly sublinear. Worse yet, Chan's algorithm requires fast matrix multi- 
plication (FMM). The 0{m^'^'^) update time follows from the theoretical FMM algorithm of Coppersmith 
and Winograd fSl. If Strassen's algorithm is used instead, the update time becomes 0{m^'^^^). Even if 
optimistically FMM could be done in quadratic time, the update time would only improve to 0(m° '^^). 
FMM has been used before in various dynamic graph algorithms (e.g., lITOl l26l ). and the paper [6] noted 
specific connections to some matrix-multiplication-related problems (see Section |2ll. All this naturally led 
one to suspect, as conjectured in the paper, that FMM might be essential to our problem. Thus, the result we 
are about to describe may come as a bit of a surprise. . . 

'We use m and n to denote the number of edges and vertices of the graph respectively; O(-) ignores polylogarithmic factors 
and O* (■) hides factors for an arbitrarily small constant £ > 0. Update bounds in this paper are, by default, amortized. 
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(a) (b) 

Figure 1: (a) Is b reachable from a staying on the roads? (b) Do the gears transmit rotation from a to 6? 



Our result. In this paper, we present a new algorithm for dynamic connectivity, achieving an improved 
vertex-update time of 0{m?/^), with an identical query time of 0(m^/^). First of all, this is a significant 
quantitative improvement (to anyone who regards an m^'^'' factor as substantial), and it represents the first 
convincingly sublinear running time. More importantly, it is a significant qualitative improvement, as our 
bound does not require FMM. Our algorithm involves a number of ideas, some of which can be traced back 
to earlier algorithms, but we use known edge-updatable connectivity structures to maintain a more cleverly 
designed intermediate graph. The end product is not straightforward at all, but still turns out to be simpler 
than the previous method ||6J and has a compact, two-page description (we regard this as another plus, not a 
drawback). 

1.2 Dynamic Geometry 

We next turn to another important class of dynamic connectivity problems — those arising from geometry. 

Geometric connectivity. Consider the following question, illustrated in Figure[Ha). Maintain a set of line 
segments in the plane, under insertions and deletions, to answer queries of the form: "given two points o 
and b, is there a path between a and b along the segments?" 

This simple-sounding problem turns out to be a challenge. On one hand, understanding any local ge- 
ometry does not seem to help, because the connecting path can be long and windy. On the other hand, the 
graph-theoretic understanding is based on the intersection graph, which is too expensive to maintain. A 
newly inserted (or deleted) segment can intersect a large number of objects in the set, changing the intersec- 
tion graph dramatically. 

Abstracting away, we can consider a broad class of problems of the form: maintain a set of n geometric 
objects, and answer connectivity queries in their intersection graph. Such graphs arise, for instance, in VLSI 
applications in the case of orthogonal segments, or gear transmission systems, in the case of touching disks; 
see Figure [Hb). A more compelling application can be found in sensor networks: if r is the radius within 
which two sensors can communicate, the communication network is the intersection graph of balls of radius 
r/2 centered at the sensors. While our focus is on theoretical understanding rather than the practicality of 
specific applications, these examples still indicate the natural appeal of geometric connectivity problems. 

All these problems have a trivial 0{n) solution, by maintaining the intersection graph through edge 
updates. A systematic approach to beating the linear time bound was proposed in Chan's paper as well |6|, 
by drawing a connection to subgraph connectivity. Assume that a particular object type allows data struc- 
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tures for intersection range searching with space S{n) and query time T{n). It was shown that geometric 
connectivity can essentially be solved by maintaining a graph of size m = 0{S{n) + nT{n)) and running 
0{S{n)/n + T{n)) vertex updates for every object insertion or deletion. Using the previous subgraph con- 
nectivity result im, an update in the geometric connectivity problem took time 0{[S{n) /n + T{n)\ ■ [S{n) + 
nT{n)]^-^^). Using our improved result, the bound becomes 0{[S{n)/n + T{n)\ ■ [S{n) + nT(n)]^/^). 

The prime implication in the previous paper is that connectivity of axis-parallel boxes in any constant 
dimension (in particular, orthogonal line segments in the plane) reduces to subgraph connectivity, with a 
polylogarithmic cost. Indeed, for such boxes range trees yield S{n) = n ■ Ig'^^'^^ n and T{n) = Ig'^^'^^ n. 
Unfortunately, while nontrivial range searching results are known for many types of objects, very efficient 
range searching is hard to come by. Consider our main motivating examples: 

• for arbitrary (non-orthogonal) line segments in R^, one can achieve T{n) = 0*{^/n) and S{n) = 
0*{n), or r(n) = 0*(nV3) g^^^ ^ 0*{n'^/^) IM- 

• for disks in R^, one can achieve T(n) = 0*(n^/3) and S{n) = 0*{n), or T{n) = (9*(n^/^) and 
S{n) = 0*(n3/2) 13]. 

Even with our improved vertex-update time, the [S{n)/n + T(n)] • [S{n) + nT(n)]2/^ bound is too weak 
to beat the trivial linear update time. For arbitrary line segments in R^, one would need to improve the 
vertex-update time to m^/^^^, which appears unlikely without FMM (see Section[2l). The line segment case 
was in fact mentioned as a major open problem, implicitly in [6] and explicitly in yj. The situation gets 
worse for objects of higher complexity or in higher dimensions. 

Our results. In this paper, we are finally able to break the above barrier for dynamic geometric connec- 
tivity. At a high level, we show that range searching with any sublinear query time is enough to obtain 
sublinear update time in geometric connectivity. In particular, we get the first nontrivial update times for 
arbitrary line segments in the plane, disks of arbitrary radii, and simplices and balls in any fixed dimension. 
While the previous reduction f6l involves merely a straightforward usage of "biclique covers", our result 
here requires much more work. For starters, we need to devise a "degree-sensitive" version of our improved 
subgraph connectivity algorithm (which is of interest in itself); we then use this and known connectivity 
structures to maintain not one but two carefully designed intermediate graphs. 

Essentially, if T{n) = 0{n^~^) and S{n) = 0{n), we can support dynamic geometric connectivity 
with update time 0(n^~^^/(2"*'^^) and query time 0(n^/(2+^)). For non-orthogonal line segments in R^, 
this gives an update time of 0*(n^/^°) and a query time of 0*(n^/^). For disks in R^, the update time is 
O*(n20/2i), with a query time of 0*{n^/'^). 

Known range searching techniques 121 from computational geometry almost always provide sublinear 
query time. For instance, Matousek f2E\ showed that 6 « 1/2 is attainable for line segments, triangles, and 
any constant-size polygons in R^; more generally, 6 « l/d for simplices or constant-size polyhedra in R'^. 
Further results by Agarwal and Matousek 131 yield 6 + 1) for balls in R'^. Most generally, 6 > is 

possible for any class of objects defined by semialgebraic sets of constant description complexity. 

More results. Our general sublinear results undoubtedly invite further research into finding better bounds 
for specific classes of objects. In general, the complexity of range queries provides a natural barrier for the 
update time, since upon inserting an object we at least need to determine if it intersects any object already 
in the set. Essentially, our result has a quadratic loss compared to range queries: if T(n) = n^~'', the update 
time is n^"®^** \ 
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In Section m We make a positive step towards closing this quadratic gap: we show that if the updates are 
given offline (i.e. are known in advance), the amortized update time can be made n^^^^''\ We need FMM 
this time, but the usage of FMM here is more intricate (and interesting) than typical. For one, it is crucial 
to use fast rectangular matrix multiplication. Along the way, we even find ourselves rederiving Yuster and 
Zwick's sparse matrix multiplication result ll38l in a more general form. The juggling of parameters is also 

more unusual, as one can suspect from looking at our actual update bound, which is 0{n'^+°'-''"/'^), where 
a = 0.294 is an exponent associated with rectangular FMM. 

2 Related Work 

Before proceeding to our new algorithms, we mention more related work, for the sake of completeness. 

Graphs. Most previous work on dynamic subgraph connectivity concerns special cases only. Frigioni and 
Italiano [ 14] considered vertex updates in planar graphs, and described a polylogarithmic solution. 

If vertices have constant degree, vertex updates are equivalent to edge updates. For edge updates, Hen- 
zinger and King ifTTIl were first to obtain polylogarithmic update times (randomized). This was improved by 
Holm et al. [201 to a deterministic solution with 0(lg^ m) time per update, and by Thorup f34l to a random- 
ized solution with 0{lgm ■ (Iglgm)^) update time. The randomized bound almost matches the Q,{lgm) 
lower bound from |30]. All these data structures maintain a spanning forest as a certificate for connectivity. 
This idea fails for vertex updates in the general case, since the certificate can change substantially after just 
one update. 

In many practical settings, these planar-graph and constant-degree special cases are unfortunately inad- 
equate. In particular, large networks of routers are often designed as overlay graphs over a (small-degree) 
geographic graph. Long fiber-optic links bypass intermediate nodes, in order to minimize the latency cost 
of passing through the electric domain repeatedly. 

For more difficult dynamic graph problems, the goal is typically changed from getting polylogarithmic 
bounds to finding better exponents in polynomial bounds; for example, see all the papers on directed reach- 
ability ifTOl l25l l32l [33l . Evidence suggests that dynamic subgraph connectivity fits this category. It was 
observed l'6'l that finding triangles (3-cycles) or quadrilaterals (4-cycles) in directed graphs can be reduced 
to 0{m) vertex updates. Thus, an update bound better than ^/m appears unlikely without FMM, since the 
best running time for finding triangles without FMM is 0(m^/^), dating back to STOC'77 |[24l . Even with 
FMM, known results are only slightly better: finding triangles and quadrilaterals takes time 0(m^'^^) JU 
and 0{m^'^^) 1371 respectively. Thus, current knowledge prevents an update bound better than m'^-^^. 

Geometry. It was shown p6l that subgraph connectivity can be reduced to dynamic connectivity of axis- 
parallel line segments in 3 dimensions. Thus, as soon as one gets enough combinatorial richness in the host 
geometric space, subgraph connectivity becomes the only possible way to solve geometric connectivity. 

When the geometry is less combinatorially rich, it is possible to find ad hoc algorithms that do not rely 
on subgraph connectivity. Special cases that have been investigated include the following: 

• for orthogonal segments or axis-parallel rectangles in the plane, Afshani and Chan |T| proposed a data 
structure with update time 0{n^^^^^) and constant query time. This is incomparable to our result of 
update time 0(n^/'^) and query time 0{n^^^). 
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• for unit axis-parallel hypercubes, the problem reduces to maintaining the minimum spanning tree un- 
der the £00 metric. Eppstein fTTl describes a general technique for dynamic geometric MST, ultimately 
appealing to range searching, and obtains polylogarithmic time per operation. 

• for unit balls, the problem reduces to dynamic Euclidean MST, which in turn reduces to range search- 
ing by Eppstein's technique [llj. In two dimensions, Chan's dynamic nearest-neighbor data struc- 
ture Q implies an 0{lg^^ n) update time for this problem. 

Dynamic geometric connectivity is a natural continuation of static geometric connectivity problems, 
which have been studied since the early 1980s. As in our case, the main challenge is to avoid working 
explicitly with the intersection graph, which could be of quadratic size. Known results include 0{n\gn)- 
time algorithms 1221 123]| for computing the connected components of axis-aligned rectangles in the plane, 
and 0(n^/^) -time algorithms |[T6l l27l for arbitrary line segments in the plane. More generally, Chan [6| 
(and later Eppstein [12]) noted the connection of static geometric connectivity to range searching, which 
implied subquadratic algorithms for objects with constant description complexity. The connection carries 
over to the incremental (insertion-only) and decremental (deletion-only) cases Q, e.g., yielding 0(n^/'^) 
update time for arbitrary line segments, reproving and extending some older results [4]. 

Another related problem is maintaining connectivity in the kinetic setting, where objects move contin- 
uously according to known flight plans. See HH [191 for the case of axis-parallel boxes, and ifTsTi for unit 
disks. 

3 Dynamic Subgraph Connectivity with 0(m^/^) Update Time 

In this section, we present our new method for the dynamic subgraph connectivity problem: maintaining a 
subset S of vertices in a graph G, under vertex insertions and deletions in S, so that we can decide whether 
any two query vertices are connected in the subgraph induced by S. We will call the vertices in S the active 
vertices. For now, we assume that the graph G itself is static. 

The complete description of the new method is given in the proof of the following theorem. It is "short 
and sweet", especially if the reader compares with Chan's paper ||6l. The previous method requires several 
stages of development, addressing the offline and semi-online special cases, along with the use of EMM — 
we completely bypass these intermediate stages, and EMM, here. Embedded below, one can find a number 
of different ideas (some also used in [6]): rebuilding periodically after a certain number of updates, dis- 
tinguishing "high-degree" features from "low-degree" features (e.g., see B |37l), amortizing by splitting 
smaller subsets from larger ones, etc. The key lies in the definition of a new, yet deceptively simple, inter- 
mediate graph G*, which is maintained by known polylogarithmic data structures for dynamic connectivity 
under edge updates |[T7l|20l|34l. Except for these known connectivity structures, the description is entirely 
self-contained. 

Theorem 1. We can design a data structure for dynamic subgraph connectivity for a graph G = {V, E) 
with m edges, having amortized vertex update time 0{m?/^), query time 0{m}-/^), and preprocessing time 

Proof. We divide the update sequence into phases, each consisting of g := m/A updates. The active vertices 
are partitioned into two sets P and Q, where P undergoes only deletions and Q undergoes both insertions 
and deletions. Each vertex insertion is done to Q. At the end of each phase, we move the elements of Q to 
P and reset Q to the empty set. This way, \Q\i?, kept at most q at all times. 
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Call a connected component in (the subgraph induced by) P high if the sum of the degrees of its vertices 
exceeds A, and low otherwise. Clearly, there are at most 0{m/A) high components. 

The data structure. 

• We store the components of P in a data structure for decremental (deletion-only) connectivity that 
supports edge deletions in polylogarithmic amortized time. 

• We maintain a bipartite multigraph F between V and the components 7 in P: for each uv & E where 
V lies in component 7, we create a copy of an edge wy G T. 

• For each vertex pair u,v, we maintain the value C[u, v] defined as the number of low components in 
P that are adjacent to both u and v in F. (Actually, only O(mA) entries of C[-, ■] are nonzero and 
need to be stored.) 

• We define a graph G* whose vertices are the vertices of Q and components of P: 

(a) For each u,v ^ Q, if C[u, v] > 0, then create an edge uv ^ G*. 

(b) For each vertex u ^ Q and high component 7 in P, if 1x7 G F, then create an edge wj & G*. 

(c) For each u,v & Q, if uv G E, then create an edge uv G G*. 

We maintain G* in another data structure for dynamic connectivity supporting polylogarithmic-time 
edge updates. 

Justification. We claim that two vertices of Q are connected in the subgraph induced by the active vertices 
in G iff they are connected in G*. The "if" direction is obvious. For the "only if" direction, suppose two 
vertices u,v ^ Q are "directly" connected in G by being adjacent to a common component 7 in P. If 7 is 
high, then edges of type (b) ensure that u and v are connected in G*. If instead 7 is low, then edges of type 
(a) ensure that u and v are coimected in G*. By concatenation, the argument extends to show that any two 
vertices u,v e Q connected by a path in G are connected in G*. 

Queries. Given two vertices vi and V2, if both are in Q, we can simply test whether they are connected 
inG*. 

If instead vj {j G {1, 2}) is in a high component jj, then we can replace vj with any vertex of Q adjacent 
to 7j in G*. If no such vertex exists, then because of type-(b) edges, 7^ is an isolated component and we can 
simply test whether vi and f 2 are both in the same component of P. 

If on the other hand Vj is in a low component jj, then we can exhaustively search for a vertex in Q 
adjacent to in F, in 0{A) time, and replace vj with such a vertex. Again if no such vertex exists, then 7^ 
is an isolated component and the test is easy. The query cost is 0(A). 

Preprocessing per pliase. At the beginning of each phase, we can compute the multigraph F in 0(m) 
time. We can compute the matrix C[-,-] in O(mA) time, by examining each edge ^7 G F and each of the 
0(A) vertices u adjacent to a low component 7 and testing whether 'yu G F. The graph G* can then be 
initialized. The cost per phas^is O(mA). We can cover this cost by charging every update operation with 
amortized cost 0(mA/g) = O(A^). 
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Update of a vertex u in Q. We need to update 0{q) edges of types (a) and (c), and 0{m/A) edges of 
type (b) in G*. The cost is d{q + m/A) = 0(m/A). 

Deletion of a vertex from a low component 7 in P. The component 7 is spht into a number of subcom- 
ponents. Since the total degree in 7 is 0(A), we can update the multigraph F in 0(A) time. Furthermore, 
we can update the matrix C[-, ■] in O(A^) time, by examining each vertex pair n, v adjacent to 7 and decre- 
menting (7[n, v] if u and lie in different subcomponents. Consequently, we need to update O(A^) edges 
of type (a). The cost is ©(A^). 

Deletion of a vertex from a high component 7 in P. The component 7 is split into a number of subcom- 
ponents 71, . . . , 7£ with, say, 71 being the largest. We can update the multigraph F in time 0(deg(72) + 
• • • + deg(7£)) by splitting the smaller subcomponents from the largest subcomponent. Consequently, we 
need to update 0(deg(72) + • • • + deg(7^)) edges of type (b) in G*. Since P undergoes deletions only, a 
vertex can belong to the smaller subcomponents in at most O(lgn) splits over the entire phase, and so the 
total cost per phase is 0{m), which is absorbed in the preprocessing cost of the phase. 

For each low subcomponent 7^, we update the matrix C[-, •] in 0(deg(7j)A) time, by examining each 
edge '^jV G F and each of the 0(A) vertices u adjacent to 7^ and testing whether ^ju G F. Consequently, 
we need to update 0(deg(7j)A) edges of type (a) in G*. Since a vertex can change from being in a high 
component to a low component at most once over the entire phase, the total cost per phase is O(mA), which 
is absorbed by the preprocessing cost. 

Finale. The overall amortized cost per update operation is 0(A^ + m/A). Set A = m^/^. □ 

Note that edge insertions and deletions in G can be accomodated easily (e.g., see Lemma |2] of the next 
section). 

4 Dynamic Geometric Connectivity with Sublinear Update Time 

In this section, we investigate geometric connectivity problems: maintaining a set 5 of n objects, under 
insertions and deletions of objects, so that we can decide whether two query objects are connected in the 
intersection graph of S. (In particular, we can decide whether two query points are connected in the union 
of S by finding two objects containing the two points, via range searching, and testing connectedness for 
these two objects.) 

By the biclique-cover technique from lO, the result from the previous section immediately implies a 
dynamic connectivity method for axis-parallel boxes with 0(n^/^) update time and 0(n^/^) query time in 
any fixed dimension. 

Unfortunately, this technique is not strong enough to lead to sublinear results for other objects, as we 
have explained in the introduction. This is because (i) the size of the maintained graph, m = 0{S{n) + 
nT{n)), may be too large and (ii) the number of vertex updates triggered by an object update, 0{S{n) jn + 
Tin)), may be too large. 

We can overcome the first obstacle by using a different strategy that rebuilds the graph more often to 
keep it sparse; this is not obvious and will be described precisely later during the proof of Theorem |5] 
The second obstacle is even more critical: here, the key is to observe that although each geometric update 
requires multiple vertex updates, many of these vertex updates involves vertices of low degrees. 
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4.1 A degree-sensitive version of subgraph connectivity 

The first ingredient we need is a dynamic subgraph connectivity method that works faster when the degree 
of the updated vertex is small. Fortunately, we can prove the following lemma, which extends Theorem[T](if 
we set A = n^/^). The method follows that of Theorem [TJ but with an extra twist: not only do we classify 
components of P as high or low, but we also classify vertices of Q as high or low. 

Lemma 2. Let 1 < A < n. We can design a data structure for dynamic subgraph connectivity for a graph 
G = {y, E) with m edges, having amortized vertex update time 

0(A^ + min{m/A, deg(u)}) 

for a vertex u, query time 0(A), preprocessing time O(mA), and amortized edge update time O(A^). 

Proof. The data structure is the same as in the proof of Theorem[T] except for one difference: the definition 
of the graph G*. 

Call a vertex high if its degree exceeds m/ A, and low otherwise. Clearly, there are at most 0(A) high 
vertices. 

• We define a graph G* whose vertices are the vertices of Q and components of P: 

(a') For each high vertex u ^ Q and each vertex t> G Q, if C[u, v] > 0, then create an edge uv G G*. 

(b) For each vertex u ^ Q and high component 7 in P, if G V, then create an edge n7 G G* . 
(b') For each low vertex u ^ Q and each component 7 in P, if G F, then create an edge wy & G*. 

(c) For each u,v ^ Q, if uv G E, then create an edge uv G G*. 

We maintain G* in a data structure for dynamic connectivity with polylogarithmic-time edge updates. 

Justification. We claim that two vertices of Q are connected in the subgraph induced by the active vertices 
in G iff they are connected in G* . The "if" direction is obvious. For the "only if" direction, suppose two 
vertices u,v ^ Q w& "directly" connected in G by being adjacent to a common component 7 in P. If 7 is 
high, then edges of type (b) ensure that u and v are connected in G* . If u and v are both low, then edges of 
type (b') ensure that u and v are connected in G* . In the remaining case, at least one of the two vertices, say, 
u is high, and 7 is low; here, edges of type (a') ensure that u and v are again connected in G* . The claim 
follows by concatenation. 

Queries. Given two vertices vi and V2, if both are in Q, we can simply test whether they are connected in 
G* . If instead Vj {j G {1, 2}) is in a component 7^, then we can replace vj with any vertex of Q adjacent to 
7j in G*. If no such vertex exists, then because of type-(b') edges, 7^ can only be adjacent to high vertices 
of Q. We can exhaustively search for a high vertex in Q adjacent to 7^ in F, in 0(A) time, and replace Vj 
with such a vertex. If no such vertex exists, then 7^ is an isolated component and we can simply test whether 
vi and V2 are both in 7^. The cost is 0(A). 

Preprocessing per phase. At the beginning of each phase, the cost to preprocess the data structure is 
O(mA) as before. We can charge every update operation with an amortized cost of 0{mA/q) = O(A^). 
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Update of a high vertex u in Q. We need to update 0{q) edges of types (a') and (c), and 0(m/A) edges 
of type (b) in G*. The cost is d{q + m/A) = 0(m/A). 

Update of a low vertex n in Q. We need to update 0(A) edges of type (a'), and 0(deg(n)) edges of 
types (b), (b'), and (c) in G*. The cost is d{deg{u) + A). 

Deletion of a vertex from a low/high component 7 in P. Proceeding exactly as in the proof of Theo- 
rem [T] we can update the data structure with amortized cost O(A^). 

Edge updates. We can simulate the insertion of an edge uv by inserting a new low vertex z adjacent to 
only u and v to Q. Since the degree is 2, the cost is 0(1). We can later simulate the deletion of this edge by 
deleting the vertex z from Q. □ 

4.2 Range searching tools from geometry 

Next, we need known range searching techniques. These techniques give linear-space data sti^uctures 
(5(n) = 0(n)) that can retrieve all objects intersecting a query object in sublinear time {T{n) = 0(n^^^)) 
for many types of geometric objects. We assume that our class of geometric objects satisfies the following 
property for some constant b > — this property neatly summarizes all we need to know from geometry. 

Property 3. Given a set P ofn objects, we can form a collection C o/canonical subsets of total size 0{n), 
in 0{n) time, such that the subset of all objects of P intersecting any object z can be expressed as the union 
of disjoint subsets in a subcollection Cz of 0{n^^^) canonical subsets, in 0{n^~^) time. Furthermore, for 
every 1 < A < n, 

(i) the number of subsets in Cz of size exceeding n/A is 0(A^~^). 

(ii) the total size of all subsets in Cz of size at most n/A is 0(n/A''). 

The property is typically proved by applying a suitable "partition theorem" in a recursive manner, 
thereby forming a so-called "partition tree"; for example, see the work by Matousek |28l or the survey 
by Agarwal and Erickson [2]. Each canonical subset corresponds to a node of the partition tree (more pre- 
cisely, the subset of all objects stored at the leaves underneath the node). Matousek's results imply that 
h = l/d — eis attainable for simplices or constant-size polyhedra in R'^. (To go from simplex range search- 
ing to intersection searching, one uses multi-level partition trees; e.g., see f29l.) Further results by Agarwal 
and Matousek [3] yield b = + 1) — e for balls in R'^ and nontrivial values of b for other families of 
curved objects (semialgebraic sets of constant degree). The special case of axis-parallel boxes corresponds 
to 6 = 1. 

The specific bounds in (i) and (ii) may not be too well known, but they follow from the hierarchical 
way in which canonical subsets are constructed. For example, (ii) follows since the subsets in Cz of size at 
most n/A are contained in 0(A^^^) subsets of size 0(n/A). In fact, (multi-level) partition trees guarantee 
a stronger inequality, '^Cf^Cz \^\^^^ — 0{n}^^), from which both (i) and (ii) can be obtained after a 
moment's thought. 

As an illustration, we can use the above property to develop a data structure for a special case of dynamic 
geometric connectivity where insertions are done in "blocks" but arbitrary deletions are to be supported. 
Although the insertion time is at least linear, the result is good if the block size s is sufficiently large. This 
subroutine will make up a part of the final solution. 
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Lemma 4. We can maintain the connected components among a set S of objects in a data structure that 
supports insertion of a block of s objects in 0{n + s-n}~^) amortized time (s < n), and deletion of a single 
object in 0(1) amortized time. 

Proof. We maintain a multigraph in a data structure for dynamic connectivity with polylogarithmic edge 
update time (which exphcitly maintains the connected components), where the vertices aie the objects of S. 
This multigraph will obey the invariant that two objects are geometrically connected iff they are connected 
in S. We do not insist that H has linear size. 

Insertion of a block B to S. We first form a collection C of canonical subsets ior S [J B by Property [3] 
For each z ^ B and each C € C^, we assign z to C. For each canonical subset C G C, if C is assigned 
at least one object of B, then we create new edges in H linking all objects of C and all objects assigned to 
C in a path. (If this path overlaps with previous paths, we create multiple copies of edges.) The number of 
edges inserted is thus 0(n + \B\n^~^). 

Justification. The invariant is satisfied since all objects in a canonical subset C intersect all objects as- 
signed to C, and are thus all connected if there is at least one object assigned to C. 

Deletion of an object z from S. For each canonical subset C containing or assigned the object z, we need 
to delete at most 2 edges and insert 1 edge to maintain the path. As soon as the path contains no object 
assigned to C, we delete all the edges in the path. Since the length of the path can only decrease over the 
entire update sequence, the total number of such edge updates is proportional to the initial length of the path. 
We can charge the cost to edge insertions. □ 

4.3 Putting it together 

We are finally ready to present our sublinear result for dynamic geometric connectivity. We again need the 
idea of rebuilding periodically, and splitting smaller sets from larger ones. In addition to the graph H (of 
superlinear size) from LemmaHJ which undergoes insertions only in blocks, the key lies in the definition of 
another subtly crafted intermediate graph G (of linear size), maintained this time by the subgraph connec- 
tivity structure of Lemma |2l The definition of this graph involves multiple types of vertices and edges. The 
details of the analysis and the setting of parameters get more interesting. 

Theorem 5. Assume < 6 < 1/2. We can maintain a collection of objects in amortized update time 
0{n^^^ /{2+'')^ answer connectivity queries in time 0{n^^^'^^^^). 

Proof. We divide the update sequence into phases, each consisting of y := updates. The current objects 
are partitioned into two sets X and Y, where X undergoes only deletions and Y undergoes both insertions 
and deletions. Each insertion is done to Y. At the end of each phase, we move the elements of y to X and 
reset Y to the empty set. This way, |y| is kept at most y at all times. 

At the beginning of each phase, we form a collection C of canonical subsets for X by Property |3] 

The data structure. 

• We maintain the components of X in the data structure from Lemma |4] 

• We maintain the following graph G for dynamic subgraph connectivity, where the vertices are objects 
of X UY, components of X, and the canonical subsets of the current phase: 
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(a) Create an edge in G between each component of X and each of its objects. 

(b) Create an edge in G between each canonical subset and each of its objects in X. 

(c) Create an edge in G between each object z G y and each canonical subset C G C^. Here, we 
assign z to C. 

(d) Create an edge in G between every two intersecting objects in Y. 

(e) We make a canonical subset active in G iff it is assigned at least one object in Y. Vertices that 
are objects or components are always active. 

Note that there are 0{n) edges of types (a) and (b), 0{yn^~^) edges of type (c), and 0{y'^) edges of 
type (d). For y = n^, the size of G is thus 0(n). 

Justification. We claim that two objects are geometrically connected in X U y iff they are connected in 
the subgraph induced by the active vertices in the graph G. The "only if" direction is obvious. For the "if" 
direction, we note that all objects in an active canonical subset C intersect all objects assigned to C and are 
thus all connected. 

Queries. We answer a query by querying in the graph G. The cost is 0(A). 

Preprocessing per ptiase. Before a new phase begins, we need to update the components in X as we 
move all elements of y to X (a block insertion). By Lemma|4j the cost is 0{n + yv}^^) = 0{n). We can 
now reinitialize the graph G containing 0{n) edges of types (a) and (b) in O(nA) time by Lemma |2l We 
can charge every update operation an amortized cost of 0{nA/y) = O(n^^^A). 

Update of an object z in Y. We need to update 0{n^^'') edges of type (c) and 0{y) edges of type (d) in 
G. The cost according to Lemma|2]is O(n^^^A^). 

Furthermore, because of (e), we may have to update the status of as many as 0(n^~^) vertices. The 
number of such vertices of degree exceeding n/A is 0(A^^^) by Property |3li), and the total degree among 
such vertices of degree at most n/A is 0(n/A^) by Property [3lii). Thus, according to Lemma[2j the cost 
of these vertex updates is ©(n^'^A^ + A^"^ • n/A + n/A'') = ©(n^^^A^ + n/A''). 

Deletion of an object z in X. We first update the components of X. By Lemma |4l the amortized cost is 
0(1). We can now update the edges of type (a) in G. The total number of such edge updates per phase is 
0{n Ig n), by always splitting smaller components from larger ones. The amortized number of edge updates 
is thus d{n/y). The amortized cost is d{{n/y)A'^) = ©(n^^^'A^). 

Finale. The overall amortized cost per update operation is 0(n^~''A^ + n/A''). Set A = ■n}'l^'^^^\ □ 

Note that we can still prove the theorem for 6 > 1/2, by handling the 0{y'^) intersections among Y (the 
type (d) edges) in a less naive way. However, we are not aware of any specific applications with 6g(1/2,1). 
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5 Offline Dynamic Geometric Connectivity 



For the special case of offline updates, we can improve the result of Section |4] for small values of 6 by a 
different method using rectangular matrix multiplication. 

Let M[ni , n2 , n^] represent the cost of multiplying a Boolean rii x 712 matrix A with a Boolean n2 x n^, 
matrix B. Let M[ni, 712, 713 | mi, ?tt,2] represent the same cost under the knowledge that the number of I's 
in A is 7721 and the number of I's in i? is m2. We can reinterpret this task in graph terms: Suppose we are 
given a tripartite graph with vertex classes Fi, V2, V3 of sizes 711,712,^3 respectively where there are mi 
edges between Vi and V2 and m2 edges between V2 and V3. Then M[7ii, 7^2, n3 | Tni, 7772] represent the cost 
of deciding, for each u ^ Vi and f G V3, whether u and v are adjacent to a common vertex in V2. 

5.1 An offline degree-sensitive version of subgraph connectivity 

We begin with an offline variant of Lemma [2l 

Lemma 6. Let I < A < q < m. We can design a data structure for offline dynamic subgraph connectivity 
for a graph G = (V, E) with m edges and n vertices, under the assumption that 0( A) vertices are classified 
as high and at most ruH edges are incident to high vertices. Updates of a low vertex u take amortized time 

0{M[A.,n,q\mH,'m]/q + deg(n)), 

updates of high vertices take amortized time 0{q), queries take time 0(A), and preprocessing takes time 
0(M[A, 77, q 1 777,// , 777,] ). 

Proof. We divide the update sequence into phases, each consisting of q low-vertex updates. The active 
vertices are partitioned into two sets P and Q, with Q C Qq, where P and Qq are static and Q undergoes 
both insertions and deletions. Each vertex insertion/deletion is done to Q. At the end of each phase, we reset 
Qo to hold all 0(A) high vertices plus the low vertices involved in the updates of the next phase, reset P to 
hold all active vertices not in Qq, and reset Q to hold all active vertices in Qo- Clearly, \Q\ < \Qq\ = 0{q). 

The data structure is the same as the one in the proof of Lemma [2l with one key difference: we only 
maintain the value C[u,v\ when u is a high vertex in Qq and u is a (high or low) vertex in Qo- Moreover, 
we do not need to distinguish between high and low components, i.e., all components are considered low. 

During preprocessing of each phase, we can now compute C[-,-] by matrix multiplication in time 
0(M[A, 77, q I 777//, 777]), since there are 0(A) choices for the high vertex u and 0{q) choices for the vertex 
V S Qo- The amortized cost per low-vertex update for this step is 0(A/[A, 77, q \ mH,m]/q). 

Updating a high vertex tt in Q now requires updating 0{q) edges of types (a') and (c) (there are no edges 
of type (b) now). The cost is 0{q). 

Updating a low vertex -u in Q requires updating 0(A) edges of type (a'), and 0(deg(?i)) edges of types 
(b') and (c) in G* . The cost is 0(deg(n)). 

Deletions in P do not occur now. □ 

5.2 Sparse and dense rectangular matrix multiplication 

Sparse matrix multiplication can be reduced to multiplying smaller dense matrices, by using a "high- 
low" trick [5]. Fact|7Ii) below can be viewed as a variant of |'6', Lemma 3.1] and a result of Yuster and 
Zwick |[38l — incidentally, this fact is sufficiently powerful to yield a simple(r) proof of Yuster and Zwick's 
sparse matrix multiplication result, when combined with known bounds on dense rectangular matrix mul- 
tiplication. Fact ITfii) below states one known bound on dense rectangular matrix multiplication which we 
will use. 
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Fact 7. 



(i) For 1 <t < mi, we have M[ni, n2, | mi, 7722] = 0(M[ni, mi/t, + m2t). 

(ii) Lef a = 0.294. If ni < min{n2,n3}", then M[ni,n2,n3] = 0(712^3). 

Proof. For (i), consider the tripartite graph setting with vertex classes Vi, V2, V3. Call a vertex in V2 high 
if it is incident to at least t vertices in Vi, and low otherwise. There are at most 0{mi/t) high vertices. 
For each u ^ Vi and f G V3, we can determine whether u and v are adjacent to a common high vertex, in 
0{M[ni,mi/t, 713]) total time. On the other hand, we can enumerate all (u, G Fi x V3 such that u and 
V are adjacent to a common low vertex, in 0{m2t) time, by examining each edge wv with {w, w) E V2 x 
and each of the at most t neighbors u G Vi of w. 

For (ii), Huang and Pan [21J have shown that ^^[71,°^, n, n] = M[n, n", n] = 0(n2). Thus, 

Mr 1 /^/f / r / 1/0^1 n rr l/cf /o/f / 1/<^1 r / 2/aN 

[ni,n2,n3j = C'( n2/n{ ■ n^i/n^ ■ M[ni,n{ ,n{ \) = 0[ n2/n{ ■ n^/n{ ■ n{ ). 

□ 



5.3 Putting it together 

We now present our offline result for dynamic geometric connectivity using Lemma |6] Although we also 
use Property [3l the design of the key graph G is quite different from the one in the proof of Theorem |5] 
For instance, the size of the graph is larger (and no longer 0{n)), but the number of edges incident to high 
vertices remains linear; furthermore, each object update triggers only a constant number of vertex updates 
in the graph. All the details come together in the analysis to lead to some intriguing choices of parameters. 

Theorem 8. Assume < 6 < 1. Let a = 0.294. We can maintain a collection of objects in amortized time 
Q(j^\+a-baii\^ offline updates and answer connectivity queries in time 0{n^^°'^^'^l'^\ 

Proof. We divide the update sequence into phases, each consisting of q updates, where g is a parameter 
satisfying A < g < nj The current objects are partitioned into two sets X and Y , with F C Iq 
where X and Yq are static and Y undergoes both insertions and deletions. Each insertion/deletion is done 
to Y . At the end of each phase, we reset Iq to hold all objects involved the objects of the next phase, X to 
hold all current objects not in Yq, and Y to hold all current objects in Iq- Clearly, |y| < |lo| = 

At the beginning of each phase, we form a collection C of canonical subsets for X U Yq by Property |3] 



The data structure. 

• We maintain the components of X in the data structure from Lemma |4] 

• We maintain the following graph G for offline dynamic subgraph connectivity, where the vertices are 
objects of X U Yq, components of X, and canonical subsets of size exceeding n/A: 

(a) Create an edge in G between each component of X and each of its objects. 

(b) Create an edge in G between each canonical subset G of size exceeding n/A and each of its 
objects in X U y. 

(c) Create an edge in G between each object z ^ Yq and each canonical subset C € of size 
exceeding n/A. Here, we assign z to G. 
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(d) Create an edge in G between each object z ^ Yq and each object in the union of the canonical 
subsets in Cz of size at most n/A. 

(e) We make a canonical subset active in G iff it is assigned at least one object in Y. We make the 
vertices inXUY active, and all components active. The high vertices are precisely the canonical 
subsets of size exceeding n/A; there are 0(A) such vertices. 

Note that there are 0{n) edges of types (a) and (b), 0(gA^^^) edges of type (c) by Property Oi), 
and 0(g?7,/A^) edges of type (d) by Property [3];ii). So the graph has size m = 0{n + qn/A^) = 
0{qn/ A^), and the number of edges incident to high vertices is rriH = 0{n + qA^~^) = 0{n). 

Preprocessing per phase. Before a new phase begins, we need to update the components in X as we 
delete 0{q) vertices from and insert 0(g) vertices to X. By LemmalU the cost is 0{n + qv}^^). We can 
then determine the edges of type (a) in G in 0{n) time. We can now initialize G in 0(M[A, n,q\n, m]) 
time by Lemma[8] We can charge every update operation with an amortized cost of 0{M[A, n,q\n,m]/ q+ 
n/q + v}'^). 

Update of an object z in Y. We need to make a single vertex update z in G, which has degree 0(n/A^) 
by Property |3lii). Furthermore, we may have to change the status of as many as 0(A^~*) high vertices by 
Property Hi). According to LemmalU the cost of these vertex updates is 0(M[A, n,q\n, m]/q + n/A'' + 

Finale. By Fact|2l assuming that A < (7" and q < n/t, we have M[A, n,q\n,m] = 0(M[A, n/t, q] + 
mt) = 0(ng/t + ngt/A^). Choosing t = A''/^ gives 0(ng/A''/2). 

The overall amortized cost per update operation is thus 0(n/A^/^ + A^^^g + n/g + n^~^). Set A = 

and q = n^+^^^wa and the result follows. (Note that indeed A < q < n/A^^^ and q < n/t for these 
choices of parameters.) □ 

Compared to Theorem [51 the dependence on b of the exponent in the update bound is only 1 — G(6) 
rather than 1 — 0(6^). The bound is better, for example, for 6 < 1/4. 

6 Open Problems 

Our work opens up many interesting directions for further research. For subgraph connectivity, an obvious 
question is whether the 0(m^/'^) vertex-update bound can be improved (without or with FMM); as we 
have mentioned, improvements beyond ^/rn without FMM are not possible without a breakthrough on the 
triangle-finding problem. An intriguing question is whether for dense graphs we can achieve update time 
sublinear in n, i.e., 0(n^^^) (or possibly even sublinear in the degree). 

For geometric connectivity, it would be desirable to determine the best update bounds for specific shapes 
such as line segments and disks in two dimensions. Also, directed settings of geometric connectivity arise 
in applications and are worth studying; for example, when sensors' transmission ranges are balls of different 
radii or wedges, a sensor may lie in another sensor's range without the reverse being true. 

For both subgraph and geometric connectivity, we can reduce the query time at the expense of increasing 
the update time, but we do not know whether constant or polylogarithmic query time is possible with sublin- 
ear update time in general (see IB for a result on the 2-dimensional orthogonal special case). Currently, we 
do not know how to obtain our update bounds with linear space (e.g.. Theorem [T] requires 0{m^^^) space). 
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nor do we know how to get good worst-case update bounds (since the known polylogarithmic resuhs for 
connectivity under edge updates are all amortized). Also, the queries we have considered are about connec- 
tivity between two vertices/objects. Can nontrivial results be obtained for richer queries such as counting 
the number of connected components (see UJ on the 2-dimensional orthogonal case), or perhaps shortest 
paths or minimum cut? 
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